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1.  Introduction. 


The  problem  of  finding  all  minimal  Markovian  (state  space)  representations  of  a given 
random  process  {y(t);  t £ T } is  known  as  the  stochastic  realization  problem . It  has  been 
studied  extensively  in  recent  years  both  in  its  deterministic  [2,3,15]  and  its  probabilistic 
[1.4-6,9,10,12-14]  aspects.  In  this  paper  we  extend  and  unify  the  axiomatic  state  space 
approaches  presented  in  [5]  and  [14]. 

The  notion  of  minimal  splitting  subspace,  a generalization  of  a concept  introduced 
in  [7] , was  applied  to  this  problem  in  [9]  and  [5] . This  is  a natural  approach,  for,  at 
any  given  time  t £ T,  a minimal  splitting  subspace  (with  respect  to  the  spaces  spanned 
by  the  past  and  the  future  of  y)  can  be  interpreted  as  a subspace  of  smallest  size  cqntain- 
ing  all  the  information  from  the  past  needed  in  predicting  the  future  and  all  the  informa- 
tion in  the  future  required  to  estimate  the  past,  making  it  an  obvious  candidate  for  a 
state  space.  A dynamical  state  space  description  will  then  require  considering  families 
{Xj,  t £ T)  of  minimal  splitting  subspaces.  However,  as  we  shall  see  below,  an  arbitrary 
such  family  will  not  in  general  yield  a stochastic  realization.  For  this  to  be  the  case,  we 
need  to  impose  a natural  growing  condition. 

We  begin  this  paper  by  solving  the  general  problem  of  finding  all  minimal  splitting 
subspaces  with  respect  to  two  arbitrary  subspaces.  This  is  a geometric  problem  in 
Hilbert  space,  and  it  contains  as  a special  case,  the  splitting  subspace  problem  of  the 
stochastic  realization  problem.  Secondly  we  apply  these  results  to  the  problem  of  find- 
ing all  Markovian  representations  of  an  arbitrary  stochastic  vector  process. 


2.  Some  preliminaries  and  notations 


Let  H be  a Hilbert  space.  For  two  subspaces  A and  B of  H (all  subspaces  are  taken 
to  be  closed),  A 1 B means  that  A and  B are  orthogonal,  A e B denotes  direct  sum, 

A © B is  the  subspace  of  A orthogonal  to  B,  A V B is  the  closed  linear  hull  of  A and  B, 
and  A^  is  the  orthogonal  complement  of  A in  H.  The  orthogonal  projection  of  X £ H 
onto  A is  denoted  E^X  or  E {X|A}.  Let  E^B  or  E {B|A  }be  the  smallest  subspace 
containing  E^B,  i.e.,  the  closure  of  E^B. 

We  shall  write  A 1 B IX  if  the  three  subspaces  A,  B and  X satisfy  the  condition 


(a,0)  = (EXa,  EX0)  for  alla£A,/J€B. 
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It  can  be  seen  that  (1)  is  equivalent  to  each  of  the  conditions 


E{a|B  V X}=  E{a|X}for  all  a€A 
E{0|A  VX}=  EMX}  for  all  P £B 


and,  if  X C A,  also  to 


A©  XJB. 


3.  Minimal  splitting  subspaces 

Let  Hi  and  H2  be  two  arbitrary  subspaces  of  H and  define  Hq  to  be  the  vector  sum 
of  these,  i.e.,  Hq  = Hi  V H2. 

Definition  1.  A subspace  X C H is  a splitting  subspace  (with  respect  to  Hi  and  H2)  if 

Hi  1H2IX. 

A splitting  subspace  X is  minimal  if  there  is  no  proper  subspace  of  X satisfying  (5)  and 
internal  if  X C Hq. 

The  following  lemma  will  be  useful  in  testing  the  minimality  of  a splitting  subspace. 

Lemma  1.  Let  X be  a splitting  subspace  and  assume  that  X = X 1 © X2  where  X 1 and  X2 
are  subspaces  of  X.  Then  X 1 is  a splitting  subspace  if  and  only  if 

X,  Xo 
E 2Hi1E  2H2 

Proof.  (Cf  (14J.)  Take  X © Hi  and  77  € H2.  Then,  in  view  of  (5), 

(X.TJ)  = (EXX,  EXt?)  = (EXlX,  EX\)  + (EX2X,  E^t,). 

X X 

Hence,  (6)  is  equivalent  to  (X,tj)  = {E  ^X,  E ^rj)  for  all  X £ Hi  and  r?  € H2,  which  is  the 
same  as  Hi  1 H2IX1.  ■ 

The  purpose  of  this  section  is  to  solve  Problem  PI. 


Problem  PI.  Find  all  minimal  splitting  subspaces. 


Let  us  also  consider  the  following  problem. 

Problem  P2.  Find  all  splitting  subspaces  X such  that 

x n h/=  0 

(7) 

X n h21  = 0 

Remark  1.  Using  the  orthogonal  decomposition 

(8) 

A * EfBlAjefAnB1), 

(9) 

which  holds  for  any  subspaces  A and  B of  H,  we  see  that  (7)  and  (8)  are  equivalent  to 

X = E{H1|X}  (7)' 

and 

X=E{H2IX}  • (8)' 

respectively. 

Proposition  1.  Problem  PI  and  P2 are  equivalent. 

Proof,  (i)  Let  X be  a solution  of  PI.  Using  (9)  we  can  write  X * Xi  ® X2  where  X^  * E*Hj 

and  X2  = XnH]\  Obviously  E = 0,  and  therefore  (6)  holds.  Hence  X^  is  a splitting 

subspace  (Lemma  1).  Then,  since  X is  a minimal  splitting  subspace,  Xj  = X,  and  thus  (7) 

holds.  In  the  same  way  we  show  (8).  Therefore  X is  a solution  of  P2. 

(ii)  Let  X be  a solution  of  P2,  and  let  X-|  be  any  splitting  subspace  contained  in  X.  To 

see  that  X is  a solution  of  PI  it  remains  to  show  that  X2  X G X<  =0.  By  Lemma  1,  (6) 

— —X  X —X 

holds.  Now,  in  view  of  X2  C X and  condition  (7)',  E = E ^E  Hi  * E ^X  = X2. 

In  the  same  way,  using  (8)',  it  can  be  shown  that  E *^H2  = X2,  and  therefore,  due  to  (6), 

X2  = 0.  ■ 

From  Proposition  1 we  see  that  our  main  problem  PI  can  be  replaced  by  the  mathemati- 
cally more  convenient  problem  P2.  To  solve  P2  we  shall  first  study  minimal  splitting 
subspaces  satisfying  one  of  the  conditions  (7)  and  (8),  say  (7). 


. 


Lemma  2.  The  subspace  X C H is  a splitting  subspace  satisfying  (7)  if  and  only  if 
X = E { H -j  | S } for  some  subspace  S3  H2. 

Proof.  (Cf.  16].)  (if):  Let  X = E {H, |S } where  S D H2.  Then  ESX  = EXX  for  each  X £ H, 

H V X 

Since  S D H2  V X,  this  implies  that  E 2 X = EXX  for  each  X 6H,,  i.e.,  in  view  of  (2), 

H j 1 H2|X.  Hence  X is  a splitting  subspace  satisfying  (7)'. 

(only  if):  Let  X be  a splitting  subspace  satisfying  (7)'.  Then,  X = EXH1  = E 2 H^, 

by  condition  (2),  and  hence  X = E {H^S}  with  S = H2  V X D H2.  ■ 

Then,  by  introducing  the  additional  condition  (8),  we  obtain  the  following  theorem 
which  gives  the  solution  of  PI.  For  this  we  first  need  to  define  the  frame  space 

H°=  E{H1|H2}VE{H2|H1}.  (10) 

which  is  itself  a (nonminimal)  internal  splitting  subspace  [5]. 

Theorem  1.  The  subspace  X is  a minimal  splitting  subspace  if  and  only  if  X = E {H 1 1S  } 
for  some  subspace  S such  that  H2CSC(H2VHd)©HqJ-. 

We  need  the  following  lemma  to  prove  this  theorem. 

Lemma  3.  Let  S D H2.  Then  ES  (H1  n H^)  = (ESH1)  n H^. 

Proof.  By  definition,  ES  (H ^ n H2X)  = {ESX  | X € H , and  X 1 H2}.  But  X 1 H2  is  equivalent 
to  E 2X  = 0,  which,  in  view  of  S D H2,  is  the  same  as  E 2 E X = 0 or  E^X  £ H^.  ■ 

Proof  of  Theorem  1.  In  view  of  Lemma  2 and  condition  (8),  it  only  remains  to  show  that 
S C (H2  V H°)  © Hq-1-  if  and  only  if  (E^Hj)  O H^  = 0.  But,  by  Lemma  3,  the  latter 
condition  is  equivalent  to  S C (H1  n Hj1)1.  Now,  using  formula  (9),  it  is  seen  that  Hq  = 

(H2  V H°)  © (Hi  O H2-*-),  and  therefore  (H1  n * (H2  V H°)  © Hq-1-.  ■ 

Corollary  1.  The  subspace  X is  an  internal  minimal  splitting  subspace  if  and  only  if  X = 

E {Hj  IS}  for  some  subspace  S such  that  H2  C S C H2  V H°. 

Theorem  1 provides  a parameterization  of  the  set  of  minimal  splitting  subspaces. 

We  shall  now  show  that  the  mapping  S-*X  is  one  to  one  if  and  only  if  S is  restricted  to 
Hq,  in  which  case  we  obtain  precisely  the  interna!  minimal  splitting  subspaces. 


Proposition  Z Let  S be  a subspace  such  that  SDH2,  and  define  X - E {H  ^ IS  }.  Then 
S = (Hj  V X)  © K for  some  subspace  K C Hq-1.  If  SC  Hg,  K = 0. 

Proof.  Clearly  S D H2  V X,  and  therefore  S = (H2  V X)  © K for  some  subspace  K.  We  will 

S X 

show  that  K C HrA  To  this  end,  as  in  the  proof  of  Lemma  2,  first  note  that  E X = E X 

H V X 

for  each  X G Hj.  Since  S D H2  V X,  this  implies  that  ESX  = E 2 X for  all  X G H -| , and 
consequently  K 1 H^.  Since,  in  addition  K 1 H2  (by  definition),  K 1 Hq.  ■ 

4.  Applications  to  the  stochastic  realization  problem 

Let  {y(t);tGT}  be  a wide  sense  separable  [11]  centered  m-dimensional  Gaussian 
stochastic  process  defined  on  a probability  space  (£ 2 , fj  P)  and  with  a time  set  T which 
is  either  discrete  or  continuous.  For  any  (complete)  sigma-field  S C F define  the 
Gaussian  space  H(S)  to  be  the  subspace  of  L2(£2,  F,  P)  consisting  of  all  centered  S- 
measurable  real  Gaussian  stochastic  variables  in  (£2,  F,  P)  [8] . The  space  H(  F)  will 
play  the  role  of  H in  Section  3;  it  is  a Hilbert  space  with  inner  product  (£,77)  = E{£r?l 
where  E{  ■ } denotes  mathematical  expection.  To  simplify  matters,  we  shall  write  H(y) 
instead  of  H(o{y(t) ; t G T»,  and  we  shall  adhere  to  this  convention  whenever  S is 
induced  by  a random  process.  It  is  well-known  [8]  that  H(y)  = spty^t);  t G T,  k = 

1,2,..., m),  where  sp{  • } signifies  the  closed  linear  hull  in  L2(£2,  F,  P). 

To  exploit  the  theory  of  minimal  splitting  subspaces  developed  in  Section  3,  we 
need  to  define  the  past  field  V f and  future  field  V * of  the  process  y for  each  t G T.  . 
Here  we  shall  take  andy*  to  be  the  sigma-fields  generated  by  (y(s) ; s < t) 
and  (y(s);  s>  t)  respectively  (take  a {<J>}  to  be  {<{>,  £2}),  but  other  definitions  are 
possible,  we  may  take  s < t instead,  and  in  some  applications  it  is  better  to  let  and 
y*  be  generated  by  the  past  and  future  increments  of  y [5,  13] . Then,  for  each  t G T, 
define  the  past  space  H^"  (y)  and  the  future  space  (y)  to  be  the  Gaussian  spaces 
H(yp  and  H(y  ^)  respectively.  In  the  setting  of  Section  3 the  subspaces  H(y),  (y) 

and  (y)  will  play  the  parts  of  H0,  H1  and  H2  respectively. 

Definition  2.  A (Gaussian)  stochastic  system  on  (£2,  F,  P)  is  a pair  (x,y)  of  centered 
(jointly  Gaussian)  stochastic  processes  (x(t);  t G T]  and  {y(t);  t G T},  taking  values 
in  an  arbitrary  real  vector  space  E and  in  Rm  respectively,  such  that,  for  every  t G T, 


the  Gaussian  subspace*  Xt  generated  by  the  random  vector  x(t)  satisfies 

[H^(y)  VXf]  l[H+(y)  VX+]|Xt.  • {11) 

where  X^"  :=  Vj^tXs  and  X*  = The  Processes  x anc*  Y are  called  the  state  process 

and  the  output  process  respectively,  and  Xt  is  the  state  space  at  time  t.  The  stochastic 
system  is  finite  dimensional  if  dimXt  < °°  for  all  t £ T. 

Remark  2.  Trivially,  the  family  (Xt;  t E T}of  state  spaces  satisfies  the  conditions 

(i)  {Xt;  t £ T}is  Markovian,  i.e.  X+  1 X^*  |Xt  for  all  tGT,  This  is  equivalent 

to  saying  that  (x(t);  t E T}  is  a Markov  process. 

(ii)  for  each  t E T,  Xt  is  a splitting  subspace  with  respect  to  H^(y)  and  (y).  ■ 

We  shall  say  that  two  stochastic  systems  (defined  on  the  same  probability  space) 
ate  equivalent  if,  for  each  t E T,  their  output  processes  agree  a.s.  and  their  state  spaces  are 
the  same.  Hence  equivalent  stochastic  systems  can  have  different  state  processes  but 
these  are  related  by  trivial  coordinate  transformations  in  the  state  spaces. 

As  an  example  let  us  consider  a discrete-time  stochastic  system  with  T = Z+, 

Proposition  3.  All  finite  dimensional  stochastic  systems  (x,y)  with  T = Z+  have  a 
representation  of  type 

x(t  + 1)  = A(t)x(t)  + B(t)w(t)  ; x(0)  = x0  (12a) 

y(t)  = C(t)x(t)  + D(t)w(t).  (12b) 

where  {A(t),  B(t),  C(t),  D(t);t  E Z+}  are  matrices  of  appropriate  dimensions,  xq  is  a 
zero-mean  Gaussian  random  vector,  and  w is  a unitary  Gaussian  white  noise  process 
independent  of  Xq.  Conversely,  any  pair  (x,y)  of  stochastic  processes  satisfying  (12) 
is  a stochastic  system. 

Proof,  (i)  Let  (x,y)  be  a stochastic  system  with  T = Z+  and  the  state  process  x taking 
values  in  Rn.  We  shall  prove  that  (x,y)  satisfies  a representation  (12).  To  this  end  first 
note  that 


t 


That  is,  Xt  :=  H(Xt),  where  X t is  the  sigma-field  induced  by  the  random  vector  x(t). 


(X(t+1)L  EH-(y)VX~  |x<t+1> I +E[H-(y)VX^]1  (X<t+1) 

V y<t>  / y(t)  / \ y(t )l 


Now,  (11)  implies  that  EH*  \ = EXtX  for  ail  X£Hj  (y)VX*  (property  (2)),  and 

consequently  there  are  matrices  A(t)  and  C(t)  such  that 


H“(y)VX-  /x(t+1A 

I V <t)/  = 


The  second  term  of  (13)  is  a white  noise  process;  it  is  the  innovation  process  of  |x<t+‘I) 
By  normalizing  we  obtain  ' ; 


;[H-(y)VX^] 


1 [x<t+1'\  J B(t,l 

\ y(t)  J \D(t)J 


( 

)fB(t)\ 

\D(t)| 


where  w is  a unitary  Gaussian  white  noise  and  B(t)  and  D(t)  are  matrices  such  that  J 

ID(1 

has  full  rank.  Hence  (x,y)  satisfies  (12).  It  remains  to  show  that  Xq  1 H(w).  But,  since 
has  full  rank,  (15)  implies  that  w(t)  € [H^“  (y)VX^T]-*-  for  all  t E Z+.  Hence  Xq  1 H(w). 
(ii)  Assume  that  (x,y)  satisfies  (12),  and,  for  each  t£  Z+,  let  Xt  be  the  Gaussian  space 
generated  by  x(t).  Since  w is  a white  noise  process  and  xq  1 H+ (w),  Xq  ® Hpw)  is 
orthogonal  to  H^ (w)  or  equivalently 

(X0®H-(w)J  l[H+(w)©Xt]|Xt 

(property  (4)).  From  (12)  it  is  easy  to  see  that  H^"  (y)VX^-  C Xq  ® H^-  (w)  and  that 
Hj  (y)VX^  C H* (w)  ® Xt  and  therefore  (11)  holds.  Hence  (x,y)  is  a stochastic  system. 


Similar  results  hold  for  continuous-time  processes  and  stationary  processes  defined  on 
the  whole  real  line. 

Definition3.  A stochastic  realization  (or  Markovian  representation)  of  (y(t);t£  i } on 
(f2,F,  P)  is  a stochastic  system  (x,z)  on  (£2,F,  P)  such  that,  for  all  t£T,  z(t)  = y(t)  a.s. 

The  realization  is  said  to  be  finite  dimensional  if  the  stochastic  system  is  finite  dimensional 
and  internal  if  Xt  C H(y)  for  t £ T. 

Theorem  2.  Every  wide  sense  separable  centered  Gaussian  stochastic  process  {y  ( t) ; t£T} 
has  a stochastic  realization  on  [El,  V , P)  where  V is  the  sigma-field  generated  by  the  process  y. 


Proof.  For  each  t G T,  set  Xt  = Ht  (y).  Since  trivially  Ht  (y)lH(y)|Ht  (y),  Xt,  thus 
defined,  satisfies  (11)  for  all  t £ T.  For  each  t G T,  choose  a basis  x(t)  in  Xt;  this  can  be 


k 


done  due  To  the  separability.  Then  (x,y)  is  a stochastic  realization  on  (£2,  X P).  Of  course, 
we  may  as  well  set  Xt  = H*(y).  ■ 

The  basic  problem  in  realization  theory  is  to  find  the  smallest  possible  state  spaces.  Of 
course,  in  general,  the  chok  a in  the  proof  of  Theorem  2 will  not  suffice  for  this  purpose.  We 
need  a concept  of  minimality.  To  this  end  first  note  that  the  family  H of  all  subspaces  of 
H(  F ) is  a partially  ordered  set  with  respect  to  vector  space  inclusion. 

Definition  4.  A stochastic  realization  (x,y)  on  (£2,F,  P)  is  minimal  at  time  t if  the  correspond- 
ing state  space  Xt  is  minimal  with  respect  to  the  ordering  of  the  partially  ordered  set  H,  i.e.  if 
there  is  no  smaller  subspace  of  H(  F ) for  which  ( 1 1 ) holds.  We  say  that  (x,y)  is  minimal 
if  it  is  minimal  at  each  time  t £ T. 

Proposition  4 (Ruckebusch).  A stochastic  realization  is  minimal  if  and  only  if  it  is  both 
observable  and  constructible,  i.e.,  for  every  t £ T, 


XtO  [H+(y)]1=0 

(observability) 

(17) 

Xtn  [H-(y)]1=  0 

(constructability). 

(18) 

The  proof  of  this  result  is  a trivial  generalization  of  one  given  in  [14]  for  the  stationary 

case. 

Minimal  splitting  subspaces  with  respect  to  the  past  and  future  spaces  of  y are  natural 
candidates  for  state  spaces  in  the  stochastic  realization  problem.  If  we  consider  splitting 
subspaces  contained  in  the  past  space  or  in  the  future  space  only,  the  Markov  property  is  a 
direct  consequence  of  the  splitting  property.  In  general,  however,  we  must  impose  a 
certain  growing  condition,  as  described  in  the  following  theorem. 

Theorem  3.  A family  (Xt;t£T)  of  separable  subspaces  of  H(F)  defines  a minimal  stochastic 
realization  (x,y)  (in  the  sense  that  Xt  is  the  Gaussian  space  generated  by  x(t)  for  each  t £ T) 
if  and  only  if 

(i)  for  each  t £ T,  Xt  is  a minimal  splitting  subspace  with  respect  to  H^"  (y)  and 
Ht+(y); 

(ii)  there  exists  a family  {Zt;  t £ T } of  subspaces,  with  the  property  Zt  C [H(y)VX+] 
for  all  t £ T such  that  the  family  {St;  t £ T } , where  St:=  [Ht  (y)VXt)  $ Zt,  is 
nondecreasing,  i.e.  S,.  -C  Sj  whenever  t < t 


Proof,  (if):  By  Condition  (i).  Lemma  2,  and  Proposition  2,  Xt=  E {H*(y)|[H“  (y)VXt]  © Kt} 

where  K£iHt+(y).  Therefore,  since  in  addition  ZtiH+(y),  Xt  = E {H+ (y)|St}.  Consequently, 
in  view  of  (9), 

St©Xt  = Stn[H  + (y)]-L.  (19) 

which  is  nondecreasing  in  t,  for  {St}  and  {[H+ (y)]  x}  are.  Hence,  since  trivially  (St  © X£)  1 Xp 
(S£  © X£)  1 X^.  By  { 19),  we  also  have  (S^  © X£)  1 H^"  (y),  and  therefore,  in  view  of  property 
(4),  Stl  [H+(y)VX*]  |Xr  But,  due  to  condition  (ii),  H^-  (y)VX^  C St,  and  consequently 
(11)  holds.  Then,  choosing  a basis  x(t)  in  X£  for  each  tE  T produces  the  required  state 
process.  The  stochastic  realization  (x,y)  is  minimal,  for  {Xt;  tET}  is  a family  of  minimal 
splitting  subspaces. 

(only  if):  Let  (x,y)  be  a minimal  stochastic  realization  with  state  spaces  {Xt;  t£T}.  Then, 
Condition  (i)  follows  from  Remark  2 (ii)  and,  as  for  the  minimality,  from  Proposition  1 
and  4;  note  that  conditions  (7)  - (8)  and  (17)  - (18)  are  identical.  To  show  that  Condition 
(ii)  holds,  set  Zt  :=  [H“  (yjVX^  ] © [H~  (y)VXt) . Then  St  = H~  (y)VX^  which  is  non- 
decreasing. It  remains  to  show  that  Zt  C [HfyjVX^]1.  It  follows  from  (1 1),  (4)  and  the 
definition  of  Zt  that 

ZtC  [H-(y)VX-]  1 {[H  + (y)VX+]  ©Xt}.  (20) 

Since,  in  addition  Ztl  [H~ (y)VXt] , Ztl[H(y)VX+]  as  required.  ■ 

We  obtain  a simpler  version  of  this  theorem  by  restricting  our  attention  to  internal 
realizations. 

Corollary  3.  A family  {X£;  tET)  of  subsDaces  of  H(y)  defines  a minimal  stochastic 
realization  of  y if  and  only  if  Condition  (i)  of  Theorem  3 holds  and  the  family  {St;  tET), 
where  here  S£  :=  H£  (y)VXt,  is  nondecreasing. 

Proof,  (if):  Noting  that  H(y)  is  separable,  this  part  follows  immediately  from  Theorem  3. 

(only  if):  It  only  remains  to  show  that  Zt  = 0 for  all  tET.  But,  by  definition, 

Zt  C Ht  (Y>Vxt  C H(vK  Therefore,  in  order  to  satisfy  the  condition  Z£  C [H^VX*]1, 
we  must  have  Zt  = 0.  ■ 

Comparing  Corollaries  1 and  3 we  can  see  from  Proposition  2 that,  in  general,  there 
exist  families  of  (internal)  minimal  splitting  subspaces  which  do  not  evolve  in  time  in  a 


Markovian  fashion.  To  see  this,  we  may  choose  a family  {St;  tGT}  which  is  not  non- 
decreasing. 


' 
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